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36. A multi-gene expression profile of a sample comprising a hybridizing 
substrate to which is hybridized a collection of amplified specific nucleic acid messages, 
which specific nucleic acid messages having been amplified in vitro simultaneously with 
5 RNA polymerase and a single primer linked to an RNA polymerase promoter, wherein 
said amplified specific nucleic acid messages each has a relative abundance which 
reflects the relative representation of a given sequence relative to other sequences in the 
collection of specific unamplified nucleic acid messages within the sample. 

10 37. The multi-gene expression profile of claim 36, wherein the hybridizing 

substrate is a nylon membrane. 

38. The multi-gene expression profile of claim 35 wherein the hybridization is by 
northern or Southern blot. 

15 

39. The multi-gene expression profile of claim 35 wherein the sample is brain, 
spleen, bone, heart, vascular tissue, lung, kidney, liver, pituitary, endocrine glands, lymph 
node, dispersed primary cells, or tumor cells. 

20 40. The multi-gene expression profile of claim 35 wherein the sample is blood 

and the expression of hemoglobin is reduced relative to the other specific messages and is 
diagnostic of thalassemia. 

45. A multi-gene expression profile of a single cell comprising a hybridized 
25 collection of amplified specific nucleic acid messages, 

which specific nucleic acid messages are amplified in vitro simultaneously with 
an RNA polymerase and a single primer linked to RNA polymerase promoter, 

wherein said multi-gene expression profile consists of said specific nucleic acid 
messages, each having a relative abundance which reflects the relative level of 
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representation of a given mRNA sequence relative to other mRNA sequences within the 
cell. 

CONCLUSION 

5 In view of the foregoing amendments, Applicants believe all claims now pending 

in this application are in condition for allowance. 
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PROCESSES FOR GENETIC MANIPULATIONS USING PROMOTERS 

Field of the Invention 

The present invention relates generally to the field 
of recombinant DNA technology and, more particularly/ to 
improved methods for producing amplified heterogeneous 
populations of RNA from limited guantities of cDNA or other 
nucleic acids. 

BACKGROUND OF THE INVENTION 
Selective amplification of cDNA's represents a major 
research goal of molecular biologists, with particular 
importance m diagnostic and forensic applications, as well as 
for general manipulations of genetic materials. 

In many important areas of research, such as m 
studying gene regulation in complex biological systems (e.g., 
the brain) having multiple phenotypes , obtaining sufficient mRNA 
for the isolation, cloning, and characterization of specific 
regulated transcripts is problematic. Research has been 
hindered by, e.g., the high complexity of the mRNA, the 
relatively low abundance of many important expressed messages, 
and the spatially limited expression of these messages. In 
particular, the identification and cloning of novel regulated 
messages from discrete cell populations has proven to be a 
formidable task. 

The polymerase chain reaction (PCR) is an extremely 
powerful technique for amplifying specific nucleic acid 
sequences, including mRNA in certain circumstances. As 
described in U.S. Patent Nos, 4,683,202 and U.S. 4,683,195 (both 
of which are incorporated herein by reference) , PCR typically 
comprises treating separate complementary strands of a target 
nucleic acid with two oligonucleotide primers to form 
complementary primer extension products on both strands that act 
as templates for synthesizing copies of the desired nucleic acid 
35 sequences. By repeating the separation and synthesis steps in 
an automated system, essentially exponential duplication of the 
target sequences can be achieved. 
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PCR, nowever, aas several well-known limitations. 
For example , PCR typically requires that 5 1 terminus and : ' 
terminus sequence information be known for the synthesis of the 
primers. Recently; homopoiymeric tailing of the 3 1 terminus 
(see Fronman, et ai . , Proc. Natl. Acad. Sci. USA 85: 3998-9002 
(1988) and Eberwme et al., Neuroscience Short Course I (Society 
for Neuroscience) 69-81 (1988)) and the synthesis of highly 
degenerate nucleotide primers (Gould et al . , Proc. Natl. Acad. 
Sci. USA 8_6: 1934-1938 (1989)) have been implemented to improve 
the range of cDNAs that can be cloned with PCR. An additional 
problem is the low fidelity of the most widely used enzyme in 
PCR f Thermus aauaticus ( Tag ) polymerase. This characteristic 
of Tag results in misincorporations that are propagated through 
the subsequent cycles of PCR amplification — ultimately 
producing faulty cDNA libraries. Also, sequences longer than 
3 kilobases create difficulties in Tag transcription, which can 
skew cDNAs to smaller sizes during amplification. Of course, 
unless modified, PCR provides amplification by DNA replication 
and not by transcription. 

In this regard, Sarkar et al . , Science 244: 33 1-34 
(1989), recently described a method, called RAWTS (RNA 
amplification with transcript sequencing) , for detecting 
extremely low abundance messages in heterologous cell types. 
This method is a modification of GAWTS (genomic amplification 
with transcript sequencing; see, Stofler, et al., Science 2^9: 
491 (1988)), which incorporates a phage promoter into at least 
one of the two primers used in PCR . In RAWTS , mRNA is amplified 
by PCR. A phage promoter incorporated into the PCR 

oligonucleotide primer allows abundant transcription, from which 
RNA can be sequenced directly. 

Four steps are used in RAWTS: (1) first strand cDNA 
synthesis from total RNA or mRNA using oligo(dT) or an mRNA- 
specific oligo primer, dNTPs , and reverse transcriptase; (2) 
PCR, wherein one or both primers contain a T7 phage promoter 
attached to a sequence complementary to the region to be 
amplified; (3) transcription of the cDNA strand with T7 RNA 
polymerase; and (4) reverse transcriptase-mediatea dideoxy 
sequencing of the resultant mRNA transcript. 



In spire of sucn recent: aavances, .nciuding PGR ana 
its various modifications noted a^ove, there exists a need for 
improved methods of identifying and cloning mRNAs and of 
accurate in vitro amplification of selected cDNA's. The methods 
should produce about 100-fold or more amplification of 
heterogeneous populations of RNA from limited quantities of 
cDNA. Preferably, the overall methodologies will be capable of 
replicating a broad range of messages without prior cloning into 
vectors and without knowledge of sequence in some instances. 
The present invention fulfills these and other needs. 

SUMMARY OF THE INVENTION 
The present invention provides novel processes for 
amplifying at least one sequence m a collection of nucleic 
acids sequences, the processes comprising ( i) synthesizing a 
nucleic acid by hybridizing a primer complex to the sequence and 
extending the primer to form a first strand complementary to the 
sequence and a second strand complementary to the first strand, 
wherein the complex comprises a primer complementary to the 
sequence and a promoter region in anti-sense orientation with 
respect to the sequence; and (2) transcribing copies of anti- 
sense RNA off of the second strand. The promoter region, which 
may be single or double stranded, is capable of inducing 
transcription from an operably linked DNA sequence in the 
presence of ribonucleotides and a RNA polymerase under suitable 
conditions. Suitable promoter regions are prokaryotes, such as 
from T3 or T7 bacteriophage. The primer is preferably a single 
stranded nucleotide, of sufficient length to act as a template 
for synthesis of extension products under suitable conditions 
and maybe poly(T) or a collection of degenerate sequences. 

In another aspect, the invention is directed to a 
processes for detecting expression of a gene in a preselected 
cell population comprising steps of: 

(a) synthesizing double-stranded cDNA by treating 
mRNAs from the cell populations with a primer complex 
comprising an oligonucleotide complementary to one or 
more of the RNA sequences, the primer linked to a 
promoter region in an orientation capaole of 



directing transcription of anti-sense RNA; 

(b) transcribing the cDNA into anti-sense RNA by 
introducing an RNA polymerase capable of operably 
binding to the promoter region; and 

(c) determining the presence or absence of 
5 transcribed anti-sense RNA complementary to mRNA 

corresponding to the gene. 
The cell population may be, e.g., from a human tissue, such as 
brain nuclei. The cell population may be single cell, or up to 
100 to 1,000,000 cells or more as desired. 
10 In another embodiment, the present invention 

comprises a process for producing a subtractive hybridization 
probe comprising: 

(a) synthesizing a first double-stranded cDNA 
collection by treating a first mRNA population with 

15 a primer complex, wherein the primer is complementary 

to the RNA sequence and is operably linked to a first 
promoter region for transcription of the cDNA strand 
complementary to the primer; 

(b) transcribing the first cDNA into anti-sense RNA 
20 by introducing a first RNA polymerase capable of 

binding to the first promoter region; 

(c) hybridizing the anti-sense RNA to a second mRNA 
population, whereby an unhybridized subpopulat ion of 
the second RNA population is found; 

25 (d) generating a second double-stranded cDNA 

collection from the unhybridized subpopulation using 
a second primer complex comprising a second promoter 
region in an orientation for transcribing anti-sense 
RNA complementary to the unhybridized subpopulation; 
and 

(e) transcribing the second cDNA into a 
ribonucleotide probe by introducing a second RNA 
polymerase capable of binding to the second promoter 
region . 

Additionally, the present invention comprises methods 
for making cDNA libraries from a collection of mRNA molecules 
comprising the steps of: 
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(a) hybridizing one or more primer complexes to a 
plurality of the inRNA's, wherein each complex 
comprises an oligonucleotide primer linked to a 
promoter sequence capable of directing transcription 
of a DNA sequence complementary to the primer; 

(b) producing a collection of double-stranded cDNAs 
by extending the primers of a plurality of any 
hybridization duplexes formed between the mRNA 1 s and 
the complexes wherein each cDNA comprises a first 
strand complementary to one mRNA molecule and a 
second strand operably linked to the promoter 
sequence ; 

(c) transcribing multiple copies of anti-sense RNA 
off of the second strand; and 

(d) preparing a cDNA library from the anti-sense RNA 
copies . 

Importantly, the present invention can be readily 
provided in kit form for a variety of uses. In addition to 
instructions, a kit will typically comprise containers of 
reverse transcriptase, RNA polymerase, and nucleotides which 
may be labelled, such as with radioactive labels (e.g., 14 C, 
3 H, 32 p and the like) . 

BRIEF DESCRIPTION OF THE FIGURE 
Figure 1. Paradigm for production of amplified, 
anti-sense RNA (aRNA) . Whole RNA is reverse transcribed using 
a synthetic primer containing the T7 RNA Polymerase binding 
site (5' AAA CGA CGG CCA GTG AAT TGT AAT ACG ACT CAC TAT AGG 
GCG (T) 3'). Second strand cDNA synthesis (producing double- 
stranded (ds) cDNA) is performed using RNase H and E. coli DNA 
polymerase I (Gubler and Hoffman, Gene 25: 263-269 (1983)). 
Following blunt-ending of cDNA with T4 DNA polymerase (at which 
point a functional T7 RNA polymerase promoter is created) , the 
cDNA is purified and transcribed with T7 RNA polymerase, 
yielding amplified, anti-sense RNA. 

DETAILED DESCRIPTION OF THE SPECIFIC EMBODIMENTS 



In accordance with the present invention, novel 



methods and compositions are provided for the amplification of 
broad classes of cDNAs * In typical embodiments, as shown m 
Figure 1, these methods involve the incorporation of an RNA 
polymerase promoter into selected cDNA molecule by priming cDNA 
synthesis with a primer complex comprising a synthetic 
oligonucleotide containing the promoter. Following synthesis 
of double-stranded cDNA , a polymerase generally specific for 
the promoter is added, and anti-sense RNA is transcribed from 
the cDNA template. The processive synthesis of multiple RNA 
molecules from a single cDNA template results in amplified, 
anti-sense RNA (aRNA) that serves, inter alia , as starting 
material for cloning procedures using random primers. The 
amplification, which will typically be at least about 20-40, 
typically to 50 to 100 or 250-fold, but may be 500 to 1000-fold 
or more, can be achieved from nanogram quantities or less of 
cDNA, and is economical, simple to perform under standard 
molecular biology laboratory conditions (see, Maniatis et al., 
(1982), Molecular Cloning: A Laboratory Manual , Cold Spring 
Harbor, which is incorporated herein by reference) . It is also 
easily adaptable into kit form. 

The techniques of the present invention also provide 
a number of additional genetic manipulation technologies. The 
aRNA represents a useful intermediate for construction of 
complex cDNA libraries from extremely limited amounts of 
tissue, such as individual brain nuclei, tissue sections, and 
even single cells. Also, with appropriate amplification 
primers, aRNA synthesis can be used for the production of 
specific ribonucleotide probes without prior cDNA cloning into 
riboprobe vectors. Further, aRNA provides a source of large 
amounts of single-stranded, anti-sense material for use as 
driver in subtractive hybridization. For example, two nucleic 
acid populations, one sense, and one anti-sense, can be allowed 
to mix together with one population present in molar excess 
(driver) , Sequences present in both populations will form 
hybrids, whereas sequences present in only one population 
remain single-stranded (Duguid et al . , Proc. Natl. Acad. Sci. 
USA 85: 5738-5742 (1988)). 

In one general embodiment of the present invention, 



cDNA strands are synthesized from a collection cf mRNA ' s using 
an oligonucleotide primer complex, i.e., a primer linked to a 
promoter region. If the target mRNA is the entire mRNA 
population, then the primer can oe a polythymidy late region 
(e.g., about 5 to 20, preferably about 10 to 15 T residues), 
which will bind with the poly (A) tail present on the 3 1 
terminus of each mRNA. Alternatively, if only a preselected 
mRNA is to be amplified, then the primer will be substantially 
complementary to a section of the chosen mRNA, typically at the 
3' terminus. The promoter region is located upstream of the 
primer at its 5 1 terminus in an orientation permitting 
transcription with respect to the mRNA population utilised. 
This will usually; but not always, mean that the promoter DNA 
sequence operanly linked to the primer is the complement to the 
functional promoter sequence. When the second cDNA strand is 
synthesized, the promoter sequence will be in correct 
orientation in that strand to initiate RNA synthesis using that 
second cDNA strand as a template. Preferably, the promoter 
region is derived from a prokaryote, and more preferably from 
the group consisting of SP6 , T3 and T7 phages (Chamberlin and 
Ryan, in The Enzymes , ed. P. Boyer (Academic Press, New York) 
pp. 87-108 (1982), which is incorporated herein by reference). 
A preferred promoter region is the sequence from the T7 phage 
that corresponds to its RNA polymerase binding site (5 1 AAA CGA 
CGG CCA GTG AAT TGT AAT ACG ACT CAC TAT AGG GCG I 1 ) . 

Once the oligonucleotide primer and linked promoter 
region hybridize to the mRNA , a first cDNA strand is 
synthesized. This first strand of cDNA is preferably produced 
through the process of reverse transcription, wherein DNA is 
made from RNA, utilizing reverse transcriptase following 
standard techniques. This enzyme, present in all retroviruses 
(e.g., avian myeloblastoma virus), adds deoxyribonucleotides to 
the 3' terminus of the primer (Varmus, Science 240 : 1427-1435 
(1988) ) . 

The second strand cDNA, creating double-stranded (ds) 
cDNA, can be synthesized by a variety of means, but preferably 
with the addition of RNase H and E. coii DNA polymerase. RNase 
assists breaking the RNA/first strand cDNA hybrid, and DNA 



polymerase synmesizes a complementary DNA sirana from the 
template DNA strand. The secona strand is generated as 
deoxynucieotides are added to the 3 1 terminus of the growing 
strand. As the growing strand reacnes the 5 ! terminus of the 
first strand DNA, the complementary promoter region of the 
first strand will be copied into the double stranded promoter 
sequence m the desired orientation. 

Thereafter, cDNA is transcribed into anti-sense RNA 
(aRNA) by introducing an RNA polymerase capaole of binding to 
the promoter region. The second strand of cDNA is transcribed 
into aRNA , which is the complement of the initial mRNA 
population. Amplification occurs because the polymerase 
repeatedly recycles on the template fi.e., reinitiates 
transcription from the promoter region) . Importantly, this 
particular technique thus enables replicating a broad range of 
cDNA's without prior cloning into vectors. Recycling of the 
polymerase on the same template avoids propagation of errors. 

The RNA polymerase used for the transcription must be 
capable of operably binding to the particular promoter region 
employed in the primer complex. A preferred RNA polymerase is 
that found in bacteriophages, in particular T3 and T7 phages. 
Substantially any polymerase/promoter combination can be used, 
however, provided the polymerase has specificity for that 
promoter in vitro sufficient to initiate transcription. 

The most preferred polymerase is T7 RNA polymerase. 
The extremely high degree of specificity shown by T7 RNA 
polymerase for its promoter site (Chamberlin et al., in The 
Enzymes , ed. P. Boyer (Academic Press, New York) pp. 37-108 
(1982)) has previously made this enzyme a useful reagent in a 
variety of recombinant DNA techniques, including in vitro RNA 
synthesis from plasmids containing the promoter site for use as 
probes (Melton et al . , Nuci. Acids Res., 12: 7035-7056 (1984)), 
for in vitro translation studies (Krieg et al., Muc. Acids Res. 
12: 7057-7070 (1984)), and for use in producing synthetic 
oligoribonucieotides (Milligan et al., Nuc . Acids Res. 1_5: 
8783-8798 (1987)). The natural T7 promoters share a highly 
conserved sequence covering about bp -17 to about -6 relative 
to the start of the RNA chain (Dunn and Studier, J. Mol . Biol. 



166 : 477-535 (1983) and J . Mol • Biol. 175 : 111-112 (1984)/ both 
of which are incorporated herein by reference) . The lack of 
efficient termination signals for T7 polymerase also enable it 
to make transcripts from almost any DNA (see, Rosenberg et al., 
Gene 56: 125-135 (1987), which is incorporated herein by 
5" reference) . The T7 polymerase is also available from a number 
of commercial sources, such as Promega Biotech, Madison, WI, 
and in a concentrated form (1000 units/ from Epicenter 
Technologies, Madison, WI, Other suitable polymerases include, 
by way of example and not limitation: T3 or SP6 (Promega) RNA 

10 polymerase and E.coli RNA polymerase, and will vary in 
accordance with the promoter region (s) utilized. 

Depending on the ultimate use of the aRNA, included 
in the transcription reaction mixture will be the necessary 
nucleotide triphosphates, where one or more of the nucleotides 

15 may be labeled, particularly with a radioactive label, such as 
32 S, 32 P, 3 H, or the like. Alternatively, nucleotides may be 
obtained which are labeled with biotin, where these nucleotides 
will become incorporated in the aRNA (see, Sive and St. John, 
Nucl. Acids Res. 1_6: 10937 (1988) and Duguid et al., Proc. 

20 Natl. Acad. Sci. USA 85: 5738-5742 (1988) both of which are 
incorporated herein by reference) . The biotin may then be used 
for binding to avidin, which is labeled with an appropriate 
label capable of providing for detection. A wide variety of 
labelling techniques are well known to those skilled in the art 

25 and may be used in accordance with standard procedures ( see , 
U.S. Patent No. 4,755,619, which is incorporated herein by 
reference) . 

In general, when characterizing the aRNA transcripts, 
various well known procedures may be employed. The transcripts 

30 ma y be removed from the reaction mixture and separated on an 
appropriate gel. The aRNA may be used as template for cDNA 
synthesis and subjected to PCR to further expand desired 
sequences. The aRNA may be used unmodified for further 
cloning, expression, use as probe or driver nucleic acid in 

35 subtractive hybridization, in Cot or Rot analysis, or the like. 

Essentially any nucleic acid sequence, in purified 
or nonpurified form, can be utilized as the starting nucleic 
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acid(s) for the methods of the present invention, provided it 
contains or is suspected to contain the specific nucleic acid 
sequence desired (i.e., complementary to the primer). It is 
only generally preferred that a sufficient number of bases at 
one end of the sequence be known in sufficient detail so that 
a primer template can be prepared which will hybridise to one 
of the strands of the desired sequence. A mixture of any of 
these nucleic acids may also be employed (including specific or 
degenerate sequences, see, e.g., PCR U.S. patents), or the 
nucleic acid produced from a previous amplification reaction 
using the same or different primers may be utilized. The 
specific nucleic acid sequence to be amplified may be only a 
fraction of a larger molecule or can be present initially as a 
discrete molecule, so that the specific sequence constitutes 
the entire nucleic acid 

It is also not necessary that the sequence to be 
amplified be present initially in a pure form; it may be a 
minor fraction of a complex mixture, or a portion of a nucleic 
acid sequence, the existence of which is due to a particular 
microorganism's presence. The starting nucleic acid sequence 
may contain more than one desired specific nucleic acid 
sequence, each of which may be the same or different from each 
other. Therefore, the amplification process is useful not only 
for producing large amounts of one specific nucleic acid 
sequence, but also for amplifying simultaneously more than one 
different specific nucleic acid sequence located on the same or 
different nucleic acid molecules. 

The nucleic acid(s) may be obtained from any source, 
for example, from plasmids such as pBR322, from cloned DNA or 
RNA, or from natural DNA or RNA from any source, including 
bacteria, yeast, viruses, organelles, and higher organisms such 
as plants or animals. DNA or RNA may be extracted from blood, 
tissue material or cells by a variety of techniques such as 
those described by Maniatis et al . , supra . 

As used herein, the term "primer complex" refers to 
an oligonucleotide having two components: (l) a primer that 
may be synthetic or a purified restriction digest and (2) 
a specifically oriented RNA polymerase promoter region. The 



primer component will be capable of acting as a point: of 
initiation of synthesis, typically DNA replication, when placed 
under conditions m which synthesis of a primer extension 
product that is complementary to a nucleic acid strand is 
induced, i.e., in the presence of appropriate nucleotides and 
z a replicating agent (e.g. , a DNA polymerase) under suitable 
conditions, wnich are well known in the art. 

The primer is preferably a single stranded 
oligodeoxynucleotide. The primer must be sufficiently long to 
act as a template for the synthesis of extension products in 

10 the presence of the replicating agent. The exact lengths of 
the primers and the quantities used will depend on many 
factors, including temperature, degree of homology and other 
conditions. For example, when amplifying a specific sequence, 
the oligonucleotide primer typically contains between about 10 

15 and 50 nucleotides, preferably 15-25 or more nucleotides, 
although it may contain fewer nucleotides, depending, e.g., on 
the promoter sequence. For other applications, the 

oligonucleotide primer is typically, but not necessarily, 
shorter, e.g., 7-15 nucleotides. Such short primer molecules 

20 generally require cooler temperatures to form sufficiently 
stable hybrid complexes with template. 

The oligonucleotide primers may be prepared using any 
suitable method, such as, for example, the well known 
phosphotriester and phosphodiester methods, or automated 

25 embodiments thereof. In one such automated embodiment, 
diethylphosphoramidites are used as starting materials and may 
be synthesized as described by Beaucage et al., Tetrahedron 
Letters 22: 1359-1962 (1981). One method for synthesizing 
oligonucleotides on a modified solid support is described in 

30 U.S. Patent No. 4,458,066. It is also possible to use a primer 
which has been isolated from a biological source (such as a 
restriction endonuclease digest) . 

The primers herein are selected to be "substantially" 
complementary to the different strands of each specific 

J 5 sequence to be amplified, i.e., the primers should be 
sufficiently complementary to hybridize with their respective 
strands. Therefore, the primer sequence need not reflect the 



exact sequence of the template, and can, in fact, be 
"degenerate . u Non-complementary bases or longer sequences can 
be interspersed into the primer, provided that the primer 
sequence has sufficient complementarity with the sequence of 
the strand to be amplified to permit hybridization and 
extension. 

Generally, it is not necessary to know the sequences 
of a target mRNA, as the primer may be a poly(T) of sufficient 
length to hybridize with substantially an entire population of 
messages (i.e., poly(T) , wherein n is typically from about 5 to 
50 or more) . Of course, when more sequence knowledge is 
available for a target RNA, the primer may be designed more 
specifically, which greatly increases the efficiency of the 
amplification. Moreover, the primer may actually comprise a 
collection of primer sequences, such as where more than one 
target sequence exists. Also, if there is ambiguity in the 
sequence information, a number of primers should be prepared. 
In particular, when any of several possible nucleic acid 
sequences encoding a protein could be correct based on 
polypeptide sequence obtained from a fragment of the protein, 
a collection of primers containing sequences representing most 
or all of the possible codon variations (utilizing codon 
degeneracy) can be prepared. 

Similarly, the promoter region will be capable of 
initiating transcription of an operably linked DNA sequence in 
the presence of ribonucleotides and an RNA polymerase under 
suitable conditions. The primer and promoter components will 
be linked in an orientation to permit transcription of a DNA 
strand that is complementary to the primer, i.e., the a RNA 
transcription will generally be in the same direction as the 
primer extension. A linker oligonucleotide between the 
components, if present, will typically comprise between about 
5 and 20 bases, but may be smaller or larger as desired. 

The second component of the primer complex, the 
promoter region, may be single stranded or double stranded. 
The region will usually comprise between about 15 and 250 
nucleotides, preferably between about 25 and 60 nucleotides, 
from a naturally occurring RNA polymerase promoter or a 



consensus promoter region (Alberts et al., in Molecular Biology 
of the Cell, 2d Ed., Garland, NY (1989), which is incorporated 
herein by reference). In general, prokaryotic promoters are 
preferred over eukaryotic promoters, and phage or virus 
promoters most preferred. As used herein, the term "operably 
linked" refers to a functional linkage between the affecting 
sequence (typically a promoter) and the controlled sequence. 

Promoter sequences are regions where RNA polymerse 
bind tightly to the DNA and contain the start site and signal 
for RNA synthesis to begin. In E.coli, typically the RNA 
polymerase molecules covers about 60 nucleotides when it binds 
to the DNA. Native strong promoters typically contain two 
highly conserved DNA sequences, each about six nucleotides 
long, which are located upstream from the start site and 
separated from each other by about 17 nucleotides of 
unrecognized DNA . When promoters are compared, the regions of 
similarity are termed "consensus" sequences. 

A wide variety of promoters and polymerases showing 
specificity for its cognate promoter are known. For example, 
the Bacillus subtilis phage SP01 synthesizes two or three new 
a factors which, when combined with host RNA polymerase, 
recognize only SP01 promoters. Probably the best studied phage 
promoter/polymerase system is E.coli phage T7 . T7 makes an 
entirely new polymerase that is highly specific for the 17 late 
T7 promoters. Rather than having two separate highly conserved 
regions like E.coli promoters, the late T7 promoters have a 
single highly conserved sequence from -17 to +6 relative to the 
RNA start site. The Salmonella phage SP6 is very similar to 
T7. The simplicity of the T3 , T7 , and SP6 systems have made 
them especially amendable to the expression of heterologous DNA 
in genetic engineering applications. 

At least one example is known of an RNA polymerase 
which recognizes a single-stranded promoter. E.coli phage N4 
makes an RNA polymerase which recognizes early N4 promoters on 
native single stranded N4 DNA. For a detailed description of 
promoters and RNA synthesis upon DNA templates, see Watson et 
a1 -/ Molecular Biology of The Gene r 4th Ed., Chapters 13-15, 
Benjamin/Cummings Publishing Co., Menlo Park, CA, which is 



14 

incorporated herein by reference. 

As used herein the term n anti-sense orientation" 
means that the promoter will initiate transcription of the 
complementary sequence to the target sequence (for example, 
when mRNA is the target sequence of the primer, anti-sense RNA 
will be produced) . 

The high specificity of bacterial phage RNA 
polymerases for their cognate promoters, in particular, 
provides a novel subtractive hybridization paradigm, although 
other polymerase and promoters may be used. Because aRNA is 
anti-sense with respect to the input RNA, aRNA produced from 
RNA population #1 (using cDNA synthesized with, e.g., the T7 
promoter-containing primer and T7 RNA polymerase) can be 
hybridized with RNA population #2, Unhybridized poly (A) + RNA 
is then be transcribed into cDNA using reverse transcriptase 
and a primer containing a different RNA polymerase promotor 
(such as T3 or SP6) . Following second strand synthesis, these 
cDNAs are then be amplified with the addition of the 
appropriate RNA polymerase, yielding a high specific activity 
probe which is used to screen an appropriate cDNA or genomic 
library. Such a technique has an advantage over most other 
subtractive hybridization protocols in that physical separation 
of single and double stranded material is not required to 
generate the subtractive probe; additionally, the resultant 
aRNA can be radioactively labeled in excess of 1 x 10 9 cpm/^ig 
(see, Melton, supra ) , permitting screening of a large number of 
clones with a limited amount of subtractive material. 

Although the aRNA paradigms of the present invention 
will provide a useful adjunct to PCR in a wide variety of 
diagnostic or other studies, especially facilitated are studies 
of gene expression in essentially any mammalian cell or cell 
population. Although the cell may be from blood (e.g., white 
cells, such as T or B cells) a typical source of cell or tissue 
will be solid organs, such as brain, spleen, bone, heart, 
vascular, lung, kidney, liver, pituitary, endocrine glands, 
lymph node, dispersed primary cells, tumor cells, or the like. 
Thus, in the neural research area, identification of mRNAs 
which vary as a function of arousal state, behavior, drug 



treatment, and development, for example, has been hindered by 
both the difficulty in construction of cDNA libraries from 
small brain nuclei and in the relative spatial insensitivity of 
subtractive hybridization techniques. Use of the aRNA 
amplification method in construction of cDNA libraries from 
individual brain nuclei will provide for greater representation 
of low-abundance mRNAs from these tissues compared with their 
representation in whole brain cDNA libraries, and facilitate 
cloning of important low-abundance messages. 

The RNA amplification technology can also be applied 
to improve methods of detecting and isolating nucleic acid 
sequences that vary in abundance among different populations, 
such as in comparing mRNA expression among different tissues or 
within the same tissue according to physiologic state. Methods 
for examining differential expression typically involve 
subtractive hybridization, wherein two nucleic acid 
populations, one sense and one anti-sense, are allowed to mix 
with one another. One population is present in molar excess 
("driver") such that sequences represented in both populations 
form hybrids, whereas sequences present in only one population 
remain single-stranded. Thereafter, various well known 

techniques are used to separate the unhybridized molecules 
representing differentially expressed sequences. 

Most methods of subtractive hybridization require 
that large amounts (generally tens of micrograms) of nucleic 
acid be available for use as "driver" in such experiments. 
This limits usefulness in examining differential expression of 
mRNAs present in a biological material that is available in 
small supply. This is overcome by cloning the nucleic acid 
populations of interest prior to subtraction, so that the 
cloning vector is used to amplify the amount of nucleic acid 
available for hybridization. However, because subtraction 
requires previous cloning, it is complicated, suffers from 
under- and over- representation of sequences depending on 
differences in growth rates in the mixed population, and may 
risk recombination among sequences during propagation of the 
mixed population. The aRNA technology of the present invention 
circumvents these problems by allowing production of large 
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amounts of anti-sense RNA from limited amounts of nucleic acid, 
without the need for previous cloning. PCR produces both 
sense- and anti-sense strands, so is less favorable for this 
use . 

An additional embodiment of the present invention is 
5 "differential amplification 1 '. This procedure allows 

amplification of only differentially expressed sequences by 
virtue of the specificity among prokaryotic RNA polymerase 
promoters . 

Also, the novel aRNA technology is used to identify 

10 sequences of interest from small amounts of nucleic acid by 
detecting the amplified sequences in the aRNA product. The 
aRNA amplification is uniform among individual sequences, and 
thus, it is useful in estimating relative levels of 
representation of a given sequence relative to other sequences 

15 within a population. Such quantitative resolution finds use in 
molecular diagnostics (such as in diagnosing thalassemias 
characterized by abnormal levels of hemoglobin gene 
expression) , where diagnosis can rest not on absolute presence 
or absence of a sequence, but on whether a given sequence is 

20 present at abnormally high or low levels relative to other 
sequences. By providing relatively linear amplification, the 
present invention offers advantages to PCR. 

Anti-sense RNA has a wide variety of uses in both 
analytical research and therapeutics. Anti-sense RNA functions 

2 5 in several prokaryotic systems to regulate gene expression. 
Similarly, anti-sense RNA can regulate the expression of many 
eukaryotic genes. This permits blocking expression of 
undesirable genes. Therapeutic use of anti-sense RNA therefore 
involves in vitro synthesis of anti-sense RNA with subsequent 

30 introduction into the subject ( see , generally, Melton, 
Antisense RNA and DNA , Cold Spring Harbor (1988), which is 
incorporated herein by reference) . 

Another application of the inventive aRNA technology 
is in detection of variant regions flanking a common sequence, 

^5 such as for molecular diagnostics. By designing an 

amplification primer that recognizes a commonly shared 
sequence, aRNA is produced that contains not only the common 
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region recognized by the primer, but also 5'- flanking sequence 
useful in detecting sequence variants. An advantage over PGR 
is that only one region of shared sequence need be known; PCR 
generally requires that shared sequences be known both 5'- and 
3'- to the region of interest, and that these flanking regions 
be sufficiently close to allow efficient amplification. Thus, 
for example, aRNA can be produced from limited amounts of 
clinical material to allow pathogen-specific sequences (such as 
those distinguishing viral types) to be identified, genetic 
polymorphisms to be detected, or alternate splicing variants to 
be characterized, all in accordance with standard techniques. 

The materials for use in the methods of the present 
invention are ideally suited for preparation of kits produced 
in accordance with well known procedures. Such a kit may 
comprise containers, each with one or more of the various 
reagents (typically in concentrated form) utilized in the 
methods, including, for example, buffers, the appropriate 
nucleotide triphosphates (e.g., dATP , dCTP, dGTP and dTTP; or 
rATP, rCTP, rGTP and UTP) , reverse transcriptase, DNA 
polymerase, RNA polymerase, and one or more primer complexes of 
the present invention (e.g., appropriate length poly(T) or 
random primers linked to a promoter reactive with the RNA 
polymerase) . A set of instructions will also typically be 
included. 

The following experimental examples are offered by 
way of illustration and not by way of limitation. 
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EXPERIMENTAL 

Materials 

Avian myeloblastoma reverse transcriptase was 
obtained from Seikagaiku, St. Petersburg, Florida; RNase H, DNA 
polymerase I, T4 DNA polymerase, and all restriction enzymes 
were from Bethesda Research Laboratories, Bethesda, Maryland; 
SI nuclease was from Boehringer Mannheim, West Germany; RNase 
Block and T3 RNA polymerase were obtained from Stratagene, La 
Jolla, CA; T7 RNA polymerase (80 units/^1) was obtained from 
Promega, Madison, WI ; and radioactive nucleotide triphosphates 
were from Amersham, Arlington Heights, IL. Oligonucleotides 
were synthesized on a BioSearch Model 8600 synthesizer and were 
purified by acrylamide gel electrophoresis prior to use. The 
rat 0-actin clone in the pCD vector was isolated by 
hybridization to the published sequence (Nudel et al., Nuc. 
15 Acids Res. 11: 1756-1771 (1983)). pGEM-2 plasmids containing 
alpha subunit sequences of G , G , G . G . G „ were prepared 

^ s o jl i i2 i3 

according to Jones et al . , J. Biol. Chem. 262 : 1421-14249 
(1987). IB15 cDNA ( cyclophil in ) cloned into the pCD vector. was 
prepared according to Sutcliffe et al., Cold Spring Harbor 
Symposia on Quantitative Biology 48* 477-484 (1983). 
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cDNA synthesis 

Total RNA was extracted from whole rat cerebellum by 
guanidium thiocyanate extraction followed by cesium chloride 

25 gradient ultracentrif ugation (Chirgwin et al., Biochemistry 7A\ 
5294-5299 (1987)). 40 \iq of total RNA was primed with lOOng of 
primer (5' AAA CGA CGG CCA GTG AAT TGT AAT ACG ACT CAC TAT AGG 
CGC (T) 3*) as schematized in Figure 1. The RNA/primer mix 
was subjected to three cycles of heat denaturation at 80 °C 

30 alternating with incubation on ice for 3 minutes each. First 
strand synthesis was performed with avian myeloblastoma virus 
reverse transcriptase, and second strand cDNA was synthesized 
using RNase H and E. coli DNA polymerase I (Gubler et al,, Gene 
25 : 263-269 (1983)). cDNA was made blunt ended by treatment 
with 2 units T4 DNA polymerase for 15 minutes at 37 °C. 
Unincorporated triphosphates were removed by drop dialysis 
against double-distilled H 2 0 for 2 hours, using a 0.025 mm 



nitrocellulose filter (Millipore, Bedford, MA) . Identical 
reactions were performed for each synthesis with and without 
incorporation of 30 uCI of 400 Ci/mM a- 32 P dCTP. 

aRNA synthesis 

3 ng of cDNA was amplified in each reaction. 
Reactions conditions were: 40 mM Tris pH 7.5, 6 mM MgCl 2 , 10 mM 
NaCl, 2 mM spermidine, 10 mM DTT, 500 mM each ATP, GTP, and 
UTP, 12.5 fiM CTP, 30 uCI a- 32 P-CTP (3000 Ci/mM). 10 units RNase 
Block, and 80 units T7 RNA polymerase in a volume of 20 m1. 
Reactions were carried out for 2 hours at 37 °C. Incorporated 
radioactivity was determined by TCA precipitation and 
scintillation counting of Cerenkov radiation (Maniatis et al., 
supra ) . 

In Situ Transcription (1ST) and amplifications 

Fresh frozen cerebellar tissue was cut by cryostat in 
a horizonal plane into 11 fj.m thick sections and processed for 
in situ transcription (1ST) as previously described (Tecott et 
al., Science 240 : 1661-1664 (1988)). Oligonucleotide 
hybridization proceeded for 12 hours followed by washing for 5 
hours in 0.5XSSC at room temperature. The 1ST reaction 
conditions were the same as described in Eberwine et al . , 
Neuroscience Short Course I (Society for Neuroscience ) 69-81 
(1988) , with the autoradiographic signals generated using a 
ration of 250 nM 32 P dCTP/2.5 uM dCTP in the reaction. The cDNA 
used in amplifications was synthesized by 1ST using 250 uM each 
of dATP, dGTP, TTP and dCTP. After the 1ST, cDNA transcripts 
were isolated (Tecott et al . , Science 240 : 1661-1664 (1988)), 
and second strand cDNA was synthesized as described (Maniatis 
et al., supra) . The hairpin-loop structure was removed by Sl- 
nuclease treatment for 5 minutes. Subsequently the cDNA was 
blunt-ended using T4 DNA polymerase. After phenol/chloroform 
extraction and ethanol precipitation with 5 micrograms of 
carrier tRNA, cDNA was dissolved in 10 microliters of H 2 0 and 
drop dialyzed for 4 hours against H 2 0. The cDNA was removed and 
2 uL was used for each aRNA amplification. Amplifications were 
performed as previously described with the exception that the 
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concentration of nonlabeled CTP in the reaction was 1.25 uM 
rather than 12.5 uM. 

Single cell injections and amplifications 

Primary cultures of cerebellar neurons were 
established from rat embryos at gestation day 20, using 
techniques described in Gruol et al . , J. Neurosci. 7: 1271-1293 
(1987) . Cultures consisting of Purkinje neurons, cerebellar 
interneurons, and glia were grown on polylysine in modified 
Eagles medium with 10% added serum. Identified mature Purkinje 
cells at 21 days in vitro were selected for cDNA synthesis. 
The whole-cell patch clamp recording technique (Yool et al., J. 
Neurosci. 8: 1971-1980 (1988)) was used to introduce the 
components for first strand synthesis intracellularly , while 
continuously monitoring cell physiology. Polished, uncoated 
patch electrodes (3-5 MOhm) were filled with 8-10 M l of the 
reaction mix, consisting of buffered saline (154 mM KC1, 6 mM 
NaCl, 5 mM MgCl 2 , 0.1 M M CaCl 2 , It) mM Hepes pH 7.3), reverse 
transcriptase (2 units/Ml), T7 55 oligonucleotide primer (0.5 
ng/Ml), and 1 mM each of dATP , dCTP, dGTP, TTP. The duration 
of loading and incubation in the whole-cell recording 
configuration ranged from 5-20 min. Simultaneous monitoring of 
electrical properties of the cell (sustained spontaneous 
firing, resting membrane potential near -60 mV, and input 
resistance >100 MOhm) indicated the loading process did not 
impair functions that are sensitive to damage. 

The soma of each cell was harvested after loading 
with suction applied through the electrode holder after opening 
the electrode tip by gently touching it against the culture 
plate. Separate electrodes were used for each cell. Harvested 
soma were incubated individually in the electrodes for 1 hr at 
37 °C to facilitate first strand synthesis, then ejected from 
the electrodes into EDTA (10 mM) in two volumes of ethanol, and 
frozen on dry ice. The samples were processed by addition of 
5 iiq of tRNA and NaCl to a final concentration of 500 mM and 
centrifuged to precipitate the nucleic acids. Residual 
cellular protein was removed by phenol/chloroform extraction 
and the nucleic acid concentrated and further purified by two 
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ethanol precipitations. The nucleic acid was dissolved in 25 
Ml of H 2 0 and heated at 95°C for 3 minutes followed by quick 
cooling on ice to separate the cDNA from RNA . To make second- 
strand cDNA, the volume was increased to 50 jxl with 2 x 
concentrated DNA polymerase buffer (Maniatis et al., supra ) and 
incubated with 10 units of DNA polymerase at 37 °C for 60 
minutes. This mixture was phenol/chloroform extracted followed 
by two ethanol precipitations. The DNA was treated with 1 unit 
of SI nuclease for 5 minutes (Maniatis et al., supra ) to remove 
the hairpin-loop structure and the sample was phenol/chloroform 
extracted and ethanol precipitated. The ends of the DNA were 
blunt-ended by incubation of the DNA with 5 units of T4 DNA 
polymerase and 200 m each of the deoxynucleotide triphosphates 
(Maniatis et al., supra ) at 37 °C for 15 minutes followed by 
phenol/chloroform extraction and ether extraction. This DNA 
mixture was drop dialyzed against 50 ml of H 2 0 for 4 hours. The 
DNA was then concentrated to 10 jxl under vacuum. Between 3 and 
5 fil of this material was used in each amplification reaction. 
aRNA amplification was done essentially as described above with 
the exception that nonlabeled dCTP was used at a final 
concentration of 1.25 and, when using 3 /il of DNA template, 
the amount of 32 P-labeled dCTP was increased to 50 uCi. 

Northern and Southern analyses 

Swiss-Webster mouse and Sprague-Dawley rat genomic 
DNA were isolated as described (Maniatis et al., supra ) . 20 ^g 
of each species DNA was digested overnight with 20 units of the 
following enzymes: BamHI, EcoRI, PstI . The DNA was 
electrophoresed on a 1% agarose gel and blotted to 
nitrocellulose (Maniatis et al . , supra ) . pCD-actin was 
restriction enzyme digested with 10 units PvuII. G G G 

S ' O ' Li ' 

G i2 , and G i3 containing plasmids were digested with restriction 
enzymes as follows: G : EcoRI and BamHI; G : EcoRI and EcoRV; 
G^: EcoRI and Xbal ; G EcoRI and Sau96; and G. : EcoRI and 
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EcoRV. These restriction enzyme digestions yield DNA fragments 
that have been used as specific probes for individual G protein 
alpha subunits (Jones et al,, J. Biol. Chem. 262: 14241-14249 
(1987)). pCD-IB15 was digested with BamHI (Sutcliffe et al . , 



supra ) . Restriction digested plasmids (10 each for all G- 
protein plasmids and actin, and 5 ug for IB15) were 
electrophoresed on a 1% agarose gel and transferred to 
nitrocellulose as for the genomic DNA. Northern blots of whole 
mouse brain and rat cerebellar RNA (15 }jlG each) were made by 
blotting 1.2% agarose/3% formaldehyde gels to nitrocellulose in 
20X SSC buffer (Maniatis et al., supra ) . Size standards (1.4 
kb and 0,7 kb) for Northern blots were provided by the in vitro 
transcription of linearized positive control pGEM-2 plasmid 
with T7 RNA polymerase. Southern and Northern transfers were 
pre-hybridized overnight at 42 "C in: 50% formamide, 6X SSC, 5X 
Denhardt's solution, and 100 jiq/ixl sheared, autoclaved salmon 
sperm DNA. For hybridization, 2.5 x 10 6 cpm of aRNA was added 
to the bag and incubated in the same buffer for 48 hours at 
42°C. Blots were washed in 0.1X SSC/0.2% SDS for 1 hour at 
42 'C and exposed to Kodak XAR film at -80°C with a Cronex 
intensifying screen (DuPont, Wilmington, DE) for suitable time 
periods . 

General characteristics of aRNA amplification 

For the general characteristics of aRNA 
amplification, see , Melton et al., Nuc. Acids Res. 12.: 7035- 
7056 (1984), which is incorporated herein by reference, which 
utilized in vitro transcription from plasmid vectors using SP6 
RNA polymerase. The degree of amplification obtained was 
strongly dependent on the enzyme concentration and incubation 
time. Using the same amount of template (3 ng) and varying the 
amount of T7 RNA polymerase from 10 units to 80 units provided 
a 4-fold improvement in yield of aRNA. Maximum levels of 
amplification were obtained from incubation times between 2 and 
4 hours; for incubations longer than 4 hours, TCA precipitable 
radioactivity began to decrease. In optimizing the reaction, 
it was found that increased UTP concentration (to 1 mM) during 
amplification increases aRNA synthesis, while inclusion of 
glycogen (used as a nonspecific carrier during cDNA 
precipitation) inhibits the aRNA reaction. Additionally, 
excess primer acts as an inhibitor of the reaction. 

In order to assess the role of additional sequence 



located 5' to the T7 promoter site in aRNA production 
(potentially necessary to stabilize the enzyme-DNA 
interaction) , amplifications were performed with synthetic 
primer lengths of 38, 57, and 80 nt, identical except for 
additional 5' sequence derived from Bluescript plasmid. 
Although cDNA generated from all three primers amplified 
equally well, the 57 nt primer (termed T7 5? ) gave the best yield 
of cDNA and was used for all subsequent amplification 
reactions. Additionally, cDNA primed with a T3 RNA polymerase 
promoter site (55 nucleotides long containing a 3' poly dT 
tract of 15 bases) was amplified with T3 RNA polymerase. This 
promoter-polymerase combination also generates a similar size 
distribution of amplified material, but yields approximately 
75% of TCA-precipitable radioactivity as compared to equal 
units of T7 RNA polymerase. Using cDNA synthesized from total 
cerebellar RNA with the T7 5? primer, an 80-fold molar 
amplification was achieved as measured by TCA-precipitable 
radioactivity . 

Amplification of a broad range of mRNA sequences from total 
cerebellar RNA 

Because the 5' promoter sequence is specific for T7 
RNA polymerase (Chamberlin et al., J. Biol. Chem. 248 : 2235- 
2244 (1973) and Chamberlin et al., in The Enzymes . ed . P. Boyer 
(Academic Press, New York) pp. 87-108 (1982)) and the RNA 
polymerase is capable of producing transcripts of 7 kb or 
larger (Melton et al . , supra ) , it was expected that the aRNA 
produced would represent accurately the size and complexity of 
the synthesized cDNA. To test this, cDNA was synthesized from 
total cerebellar RNA using the T7 5? primer. A nonlabeled 
portion of this cDNA was then amplified with T7 RNA polymerase 
in the presence of a- 32 P CTP. The size distribution of cDNA and 
aRNA were very similar. 

To characterize further the aRNA produced, Southern 
blot analysis of mouse and rat genomic DNA were performed using 
labeled aRNA derived from rat cerebellum as a probe. The aRNA 
hybridized to a broad distribution of sequences, indicating 
considerable sequence heterogeneity in the amplified material. 



Additionally, the aRNA showed marked sequence specificity for 
rat genomic DNA over that of mouse. A similar experiment was 
performed using aRNA to probe a Northern blot of whole rat 
cerebellar RNA. Again, the aRNA bound to a wide range of 
sequences indicating sequence heterogeneity in the amplified 
material. Approximately 30% of the total grain density of the 
bound material was found in the 18S and 28S ribosome bands. 
This is consistent with the yield of cDNA corresponding to 
these RNA species when cDNA is made from total RNA, suggesting 
that aRNA abundance is representative of the parent cDNA « 

In order to quantify the complexity of amplified 
material, R Q t hybridization analysis (Van Ness et al . , Nuc. 
Acids Res. 10: 8061-8077 (1982)) of the aRNA was performed. 
Although a variety of hybridization and RNase conditions were 
tested, it was not possible to adequately reduce the non- 
hybridized background of the aRNA; 40% of total single-stranded 
aRNA remained TCA-precipitable after 500 Mg/ml RNase A 
treatment in 0 . 5M NaCl at 37'C for 30 minutes. This material 
was fully digested following sample boiling. Thus, the 
heterogeneous amplified material has considerable secondary and 
tertiary structure. 

Detection — of specific sequences in aRNA synthesized from 

nanogram quantities of cDNA 

In order to be useful in the detection and 
characterization of regulated transcripts, an amplification 
method must be able to amplify low-abundance transcripts. To 
detect such transcripts, aRNA was generated from 3 ng of 
cerebellar cDNA which had been primed with the T7 5? primer. The 
aRNA was labeled to a specific activity of 5 x 10 8 cpm/^g. 2.5 
x 10 6 cpm were used to probe 10 /xg each of plasmids containing 
inserts for rat actin, and the alpha subunits of rat G-proteins 
G s' G o' G u' G l2 ' and G l3 * The vect °r sequences in each lane 
serve as an internal non-specific hybridization control. 
Following washing of filters in 0.1X SSC/0.1% SDS at 42°C, 
strong signals were visible in the actin and G lanes. Other G 

s 

protein mRNAs were not visible using this exposure time; 
however, since the G t and G o -protein alpha subunits share 
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considerable homology, this may be due to dilution of the aRNA 
for these G-protein alpha subunits among the multiple DNA 
restriction fragments encoding these sequences. In order to 
test this possibility, the presence of particular messages in 
the aRNA was assayed by probing slot-blots of aRNA and parent 
cDNA with random hexamer-primed probes for actin and the G- 
proteins. Qualitatively identical results were found for both 
cDNA and aRNA: a very strong signal was discerned for the actin 
probe in both aRNA and cDNA , a weaker signal was observed for 
G , and very weak signals were discerned for G , G , and G 

o li x3 

Detection of specific aRNA transcripts from cerebellar tissue 
sections 

The determined 1ST autoradiographic signal suggested 
that cDNA synthesis occurred in many cell types, with high 
signal density in the granule cell layer of the cerebellum. 
This result reflects the expected increase in amount of A+ -mRNA 
in regions of high cellular density. In parallel experiments, 
this signal was comparable to the cDNA signal generated using 
oligo-dT (36 bases long) as a primer and is at least an order 
of magnitude greater than the signal generated from background 
non-specific 1ST signals produced from endogenous primer- 
template complexes. The aRNA produced from IST-generated cDNA 
contains sequences for IB15 (band at approximately 680 
nucleotides) and actin was also determined. The variations in 
hybridization intensities reflect differences in the abundance 
of aRNAs encoding each of these molecules, as well as the 
differing amounts of radioactivity incorporated into aRNA for 
longer mRNAs versus shorter mRNAs . Background is minimal as 
shown by the lack of hybridization of aRNA to the pCD vector 
in the IB15 sample and the lack of binding to the DNA ladder. 
The actin and G s signals on the autoradiogram correspond to DNAs 
of 3 kilobases in length and longer, because the cDNAs encoding 
these proteins were not separated from the plasmid vector by 
restriction enzyme digestion and hence appear in multimer forms 
of the supercoiled plasmid. 



aracterization of ampl ification products from a single cell 



A schematic of the microinjection technique used to 
introduce the T7 57 -oligonucleotide into individual Purkinje 
cells was prepared. The cerebellar primary culture provides 
access to identified Purkinje cells throughout their 
development to maturity by three weeks in vitro . The mature 
stage of an individual Purkinje cell is characterised by 
extensive dendritic outgrowth, expression of Purkin j e-cell 
specific immunohistochemical markers, and endogenous pacemaker 
firing activity properties (Gruol et al., supra ) comparable to 
Purkinje neurons of similar age in vivo . 

The amount of radiolabel incorporated into aRNA 
obtained from single cell amplifications from individual 
Purkinje cells ranged from 30,000 to 300,000 cpm. The reasons 
for this variability are unclear, but may result from different 
amounts of first strand cDNA synthesis, variable success in 
regeneration of functional T7 RNA polymerase promoter site 
during the preparation of the cDNA template or partial 
degradation of the T7 5? -oligonucleot ide (Wickstrom, J. Biochem. 
Biophys. Methods 13: 97-102 (1986)). This low number of 
radioactive counts precluded the use of single cell aRNA as a 
probe to screen for specific low abundance mRNAs , yet was 
sufficient to serve as a probe for cerebellar RNA to gauge the 
complexity of the amplified material. The aRNA probe 
hybridized to poly (A) + and total RNA isolated from the rat 
cerebellum. The specificity of this RNA signal is evident in 
the lack of hybridization of aRNA to E. coli total RNA and 
pBluescript DNA. 

In addition to amplifying an entire population of A'- 
mRNA using a single primer complex, degenerate primers have 
been utilized to amplify specific sequences for a specific 
class of mRNAs. In attempts to clone receptors that are part 
of the G-protein coupled receptor family, a T7-amplif ication 
oligonucleotide primer which uses a sequence common to all of 
these receptors was synthesized as the primer (rather than 
oligo-dT i5 which will prime a majority of mRNAs). The primer 
was used to prime cDNA synthesis from mRNA isolated from NG108- 
15 cells. This first strand cDNA was then processed as 
described above. The population of resultant aRNA molecules 
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was less "complex" than that made using the T7-57 primer and 
contains sequences for the receptors of interest. As is the 
case for hybridization of specific probes to mRNA in Northern 
analyses, the generation of aRNAs specific to certain mRNAs is 
only limited by the specificity of the sequences used to prime 
cDNA synthesis. For degenerate primers, reference is made to 
Gould et al., Proc. Natl. Acad. Sci. U.S.A. 86: 1934-1938 
(1989), which is incorporated herein by reference. 

From the foregoing, it will be appreciated that novel 
techniques have been devised for the in vitro amplification of 
heterogeneous, complex populations of cDNA by generating cDNA 
with a primer containing a phage RNA polymerase promoter 
sequence. The technique is capable of greater than 80 to 1000 
or more-fold molar amplification of starting cDNA. Amplified, 
anti-sense RNA (aRNA) produced by these techniques can be 
prepared that closely matches the size distribution of the 
parent cDNA. The amplified material contains heterogeneous 
sequences as determined by binding of aRNA to genomic Southern 
blots and specific Northern blots; additionally, detection of 
specific messages of various abundances in the amplified 
material can be performed. 

Moreover, the spectrum of aRNA produced during 
amplification qualitatively reflects the population of cDNA 
from which it is produced. Since relative amounts of 
individual sequences present in cDNA represent (to a first 
approximation) their relative abundances in the transcribed RNA 
population and since aRNA production appears to be linear, the 
amount of a specific RNA in an aRNA population should reflect 
its abundance in the original RNA population. Given this, 
'results from both aRNA-probed Southern blots and slot-blots of 
aRNA and cDNA probed with actin and G-protein cDNAs suggest 
that actin mRNA is considerably more abundant in rat cerebellum 
than any of the G-alpha mRNAs tested. of greater interest is 
the finding that G-alpha produced a substantially higher 
hybridization signal than signals from the other G-alpha 
sequences tested. This is surprising because G-alpha protein 
levels are lower than those of other G-alpha "subunits in a 
variety of tissues (Gilman, Ann. Rev. Biochem. 56: 615-649 
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(1987)), including brain (Gierschik et al . , Proc . Natl. Acad. 
Sci. USA 8_3: 2258-2262 (1986)). Such high relative abundance 
of G s ~alpha mRNA has, however, been detected in other brain 
regions by in situ hybridization (Largent et al. , Proc. Natl. 
Acad. Sci. USA 85: 2864-2868 (1988)) as well as in the NG108 
neuronal cell line. These results are consistent with the 
hypothesis that unique control of regulation for G s -alpha in 
rates of protein turnover and/or mRNA translation may exist 

(Largent et al . , supra ) . 

Although the foregoing invention has been described 
in some detail by way of illustration and example for purposes 
of clarity of understanding, it will be apparent that certain 
changes and modifications may be practiced within the scope of 
the appended claims. 



WE CLAIM: 



1* A process for amplifying at least one target 
nucleic acid sequence comprising: 

(a) synthesizing a nucleic acid by hybridizing a 
primer complex to the sequence and extending the primer to form 
a first strand complementary to the sequence and a second 
strand complementary to the first strand, wherein the complex 
comprises a primer complementary to the sequence and a promoter 
region in an anti-sense orientation with respect to the 
sequence; and 

(b) transcribing copies of anti-sense RNA off of the 
second strand. 



2. A process according to Claim 1, wherein the 
primer complex comprises a promoter region from a T 
bacteriophage . 



3. A process according to Claim 1, wherein the 
bacteriophage is T7 or T3 . 



4. A process according to Claim l, wherein the 
target nucleic acid sequence is mRNA. 

5. A process according to Claim 1, wherein the RNA 
sequence has a poly (A) tail. 

6. A process according to Claim 5, wherein the 
primer complex comprises a poly(T) primer sequence, 

7. A process for detecting expression of a gene in 
a preselected cell population comprising: 

(a) synthesizing double-stranded complementary 
deoxyribonucleic acid (cDNA) by treating mRNAs from the cell 
population with a primer complex comprising an oligonucleotide 
complementary to one or more of the RNA sequences, the primer 
linked to a promoter region in an orientation capable of 
directing transcription of anti-sense RNA; 



(b) transcribing the cDNA into anti-sense RNA by 
introducing an RNA polymerase capable of operably binding to 
the promoter region; and 

(c) determining the presence or absence of transcribed 
anti-sense RNA complementary to mRNA corresponding to the gene. 

8. A process according to Claim 7, wherein the cell 
population is from a human tissue. 

9. A process according to Claim 7, wherein the 
tissue library is brain nuclei. 

10. A process according to Claim 7 , wherein the cell 
population comprises about 100 cells. 

11- A process for producing a subtractive 

hybridization probe comprising: 

(a) synthesizing a first double-stranded cDNA collection 
by treating a first mRNA population with a primer complex, 
wherein the primer is complementary to the RNA sequence and is 
operably linked to a first promoter region for transcription of 
the cDNA strand complementary to the primer; 

(b) transcribing the first cDNA into anti-sense RNA by 
introducing a first RNA polymerase capable of binding to the 
first promoter region; 

(c) hybridizing the anti-sense RNA to a second mRNA 
population, whereby an unhybridized subpopulation of the second 
RNA population is found; 

(d) generating a second double-stranded cDNA collection 
from the unhybridized subpopulation using a second primer 
complex comprising a second promoter region in an orientation 
for transcribing anti-sense RNA complementary to the 
unhybridized subpopulation; and 

(e) transcribing the second cDNA into a ribonucleotide 
probe by introducing a second RNA polymerase capable of binding 
to the second promoter region. 



A method for making a cDNA library from a 



collection of mRNA molecules comprising: 

(a) hybridizing one or more primer complexes to a 
plurality of the mRNA's, wherein each complex comprises an 
oligonucleotide primer linked to a promoter sequence capable of 
directing transcription of a DNA sequence complementary to the 
primer; 

(b) producing a collection of double-stranded cDNAs 
by extending the primers of a plurality of any hybridization 
duplexes formed between the mRNA 1 s and the complexes wherein 
each cDNA comprises a first strand complementary to one mRNA 
molecule and a second strand operably linked to the promoter 
sequence ; 

(c) transcribing multiple copies of anti-sense RNA 
off of the second strand; and 

(d) preparing a cDNA library from the anti-sense RNA 

copies . 

13. A method utilizing a promoter for amplifying a 
nucleic acid sequence comprising: 

(a) binding to the nucleic acid sequence a primer 
operably linked to a promoter sequence capable of initiating 
transcription of RNA complementary to the nucleic acid 
sequence ; 

(b) synthesizing complementary DNA by elongation from 
the primer; 

(c) extending the complementary DNA to generate a 
functional promoter; and 

(d) adding RNA polymerase to initiate RNA synthesis 
from the promoter, whereby multiple copies of the nucleic acid 
sequence are produced . 

14. A method according to Claim 13, wherein the 
promoter and polymerase are a cognate pair. 

15. A method according to Claim 13, wherein the 
promoter is a naturally occurring prokaryotic promoter. 



16. A method according to Claim 13, wherein the 
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promoter is a bacteriophage promoter. 



17. A method according to Claim 13, wherein the 
promoter is a T3, T5 , or SP6 promoter. 

5 18. A method according to Claim 13, wherein 

complementary DNA is synthesized by reverse transcriptase. 

19. A method according to Claim 13, wherein the 
nucleic acid is RNA. 

10 

20. A method according to Claim 19, wherein the RNA 
sequence has a poly (A) tail. 

21. A method according to Claim 19, wherein the 
15 primer comprises a poly(T) sequence. 

22. A method for subtractive hybridization 
comprising the steps of: 

(a) binding a primer to sense RNA molecules in a 
population, wherein the primer is operably linked to a promoter 
sequence in an anti-sense orientation; 

(b) synthesizing complementary DNA by elongation from 
said primer; 

(c) extending the complementary DNA to generate a 
25 functional promoter; 

(d) initiating RNA synthesis from the promoter by 
adding RNA polymerase, whereby anti-sense RNA is produced; 

(e) introducing the anti-sense RNA in molar excess to 
a second population of sense RNA molecules, whereby 
substantially complementary RNA sequences from the two 
populations hybridize; and 

(f) isolating remaining single-stranded sense RNA, 
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23. A method according to Claim 22, wherein the 
amplified RNA is labeled during synthesis. 



24. A method for detecting the expression of a gene 



in one or more cells, said method comprising the steps of: 

(a) binding a primer to RNA molecules in a 
population, wherein the primer is operably linked to a promoter 
sequence in an anti-sense orientation; 

(b) synthesizing complementary DNA by elongation from 
said primer; 

(c) extending the complementary DNA to generate a 
functional promoter ; 

(d) initiating RNA synthesis from the promoter by 
adding RNA polymerase, thereby producing amplified RNA; 

(e) analyzing the amplified RNA to determine 
expression of the gene. 

25. The method according to Claim 24, wherein the 
amplified RNA is analyzed by hybridizing to a DNA sequence 
substantially corresponding to the gene to detect RNA copies of 
the gene. 

26. A method according to Claim 25, wherein the DNA 
sequence is present on a Southern blot. 

27. A method according to Claim 24, wherein the RNA 
is radioactively labeled during synthesis. 

28. A method according to Claim 27, wherein the cell 
is obtained from human tissue. 

29. A method according to Claim 24, wherein the 
cells comprise about 100 cells. 

30. A method for amplifying mRNA in single brain 
cells comprising: 

(a) introducing into the brain cells a reaction 
mixture comprising primer operably bound to promoter sequence 
in anti-sense orientation, reverse transcriptase, dATP, dCTP, 
dGTP, and dTTP thereby generating first strand cDNA; 

(b) harvesting brain cell soma; 

(c) precipitating nucleic acids from the brain cell 
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soma ; 

(d) synthesizing second-strand cDNA by the addition 
of DNA polymerase; 

(e) removing hairpin-loop structures in the cDNA by 
treating with SI nuclease; 

(f) synthesizing a functional promoter by adding T4 
DNA polymerase, dATP, dCTP, dGTP and dTTP; 

(g) initiating transcription by adding T7 RNA 
polymerase, rATP, rCTP, rGTP and UTP; whereby anti-sense RNA is 
produced . 

31. A component kit for the production of amplified 
RNA comprising: a container comprising a primer complex 
component and instructions for use of said primer complex. 

32. A component kit according to Claim 31, further 

comprising 

(a) a container comprising a reverse transcriptase; 

(b) a container comprising an RNA polymerase; 

(c) a container comprising dATP, dCTP, dGTP, and dTTP 
nucleotides; or 

(d) a container comprising rATP, rCTP, rGTP, and UTP 
nucleotides. 



33. A component kit according to Claim 31, wherein 
the primer complex comprises a mixture of nucleic acid 
sequences. 



34. A component kit according to Claim 31, wherein 
the primer complex comprises a poly(T) primer capable of 
hybridizing to a collection of mRNA molecules. 

35. A composition comprising a collection of primer 
complexes, wherein a primer in one primer complex is poly(T) , 
wherein n is from about 5 to 50, or a population of degenerate 
primers . 
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ABSTRACT OF THE DISCLOSURE 

This invention relates to the use of promoters for 
ribonucleic acid amplification and other genetic manipulations. 
Processes are provided wherein complementary deoxyribonucleic 
acid (cDNA) is synthesized from a ribonucleic acid (RNA) 
sequence using a complementary primer linked to an RNA 
polymerase promoter region complement and then anti-sense RNA 
(aRNA) is transcribed from the cDNA by introducing an RNA 
polymerase capable of binding to the promoter region. 
Additional processes using the resulting aRNA are also 
described. 
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